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Abstract 

Community detection in complex networks is a topic of high interest in many helds. Bipartite 
networks are a special type of complex networks in which nodes are decomposed into two disjoint 
sets, and only nodes between the two sets can be connected. Bipartite networks represent diverse 
interaction patterns in many real-world systems, such as predator-prey networks, plant-pollinator 
networks, and drug-target networks. While community detection in unipartite networks has been 
extensively studied in the past decade, identihcation of modules or communities in bipartite networks 
is still in its early stage. Several quantitative functions proposed for evaluating the quality of bipartite 
network divisions are based on null models and have distinct resolution limits. In this paper, we 
propose a new quantitative function for community detection in bipartite networks, and demonstrate 
that this quantitative function is superior to the widely used Barber’s bipartite modularity and 
other functions. Based on the new quantitative function, the bipartite network community detection 
problem is formulated into an integer programming model. Bipartite networks can be partitioned into 
reasonable overlapping communities by maximizing the quantitative function. We further develop 
a heuristic and adapted label propagation algorithm (BiLPA) to optimize the quantitative function 
in large-scale bipartite networks. BiLPA does not require any prior knowledge about the number of 
communities in the networks. We apply BiLPA to both artihcial networks and real-world networks 
and demonstrate that this method can successfully identify the community structures of bipartite 
networks. 

Keywords: bipartite network; community; quantitative function; integer programming; label 
propagation algorithm 


1 Introduction 

Many real-world systems, such as the Internet, social relationships, food webs, and biological systems, 
can be represented as complex networks W- Community and its detection is an important and valuable 
topic of great interest in network science and many other disciplines from sociology, computer science to 
biology gHZ]. A community in a complex network is often defined as a set of nodes which are densely 
connected with each other, but sparsely connected with nodes outside of the set. 

To evaluate the quality of a network partition or community structure, a proper quantitative function 
is needed. Modularity Q introduced by Newman and Girvan [5] is the most widely used function. Q 
measures the density of edges within communities as compared to a null model. In other words, a partition 
with high modularity should be the one that the density of edges in modules is significantly higher 
than random expectation. Modularity optimization has become a very popular method for community 
detection in the past decade. However, Q has been exposed to have serious resolution limit issues l^fTO]. 
It contains an intrinsic scale that depends on the total number of edges in the network which makes it 
fail to detect dense communities smaller than this scale [5]. Recently, Bagrow m reported that trees 
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and treelike networks can have arbitrarily high values of modularity Q which contradicts the notion of 
communities as being unusually densely interconnected groups of nodes. 

To overcome the above issues, several other quantitative functions have been proposed HMD- For 
example, Li et al. m proposed community density D based on the concept of average degree of a commu¬ 
nity. Ahn et al. |14) proposed the link community detection concept and defined a quantitative function 
based on the average link density. Li et al. |15j improved Ahn’s link community quantitative function. 
Although these quantitative functions [IMl] show certain advantages compared to the modularity Q, 
they are only designed for detecting communities in unipartite networks. 

Although most attention has focused on the community detection in unipartite networks (see and 
references therein), many real-world relations, such as plant-pollinator, order-item, paper-author, and 
event-attendee, are more suitable to be represented as bipartite networks m- In bipartite networks, 
nodes are divided into two disjoint sets (i.e., a bipartite network is composed of two types of nodes), and 
only two nodes from different sets can be connected. In the past several years, community detection in 
bipartite networks has attracted great interests as well [TBl - El] . 

Several quantitative functions and algorithms for community detection in bipartite networks have 
been developed |161I24I[26] . Freeman [26) proposed a projection-based algorithm for bipartite community 
detection, where a bipartite network is projected to a unipartite network. In the projected network, 
two nodes are connected if they are adjacent to one or more common nodes in the bipartite network. 
Obviously, some information of the original bipartite network is lost in the projected network, even if the 
projection is weighted. 

In Guimera’s opinion, a bipartite network community is composed of nodes of the same type |16j . 
In order to measure the connectivity between nodes of different types, a bipartite modularity function 
considering both node types has to be used. Barber proposed a bipartite modularity function based 

on the assumption that a community is a bipartite subgraph composed of nodes of both types. Murata 
and Ikeya [24j proposed another bipartite modularity that uses a pair of two values to represent bipartite 
modularity in both directions, which allows one-to-many correspondence of communities of different 
node types. This bipartite modularity can reveal results consistent with those obtained by Newman’s 
modularity when it is applied to unipartite networks. Suzuki and Wakita [28j modified the version 
of Murata’s bipartite modularity so as to reflect the multi-facet correspondence among communities. In 
addition, several studies have designed algorithms to maximize these modularity functions for community 
detection in bipartite networks [IMDlMllM]. More recently, Chang and Tang proposed a probabilistic 
model to find modules in unipartite, bipartite, and mixture networks [29j . 

So far there has been no agreement on a standard definition for community detection in bipartite 
networks. Since the above definitions of bipartite modularity are all based on the null model used in 
Newman-Girvan modularity, they all have the resolution limit issue mM and fail to detect communities 
smaller than a detectable scale that depends on the size of a network and the interconnectivity of its 
communities. When the scale of the network is sufficiently large, optimizing bipartite modularity favors 
network divisions with groups of small communities merged into larger communities, which may lead to 
ambiguities [20]. 

In this paper, we propose a novel quantitative function - bipartite partition density for evaluating 
community partitions in (both unweighted and weighted) bipartite networks. This quantitative function 
is not a simple generalization of the average link community density in m for bipartite networks. 
In this new quantitative function, we partition the nodes rather than links of bipartite networks into 
communities, i.e., every node must belong to at least one community while some links may not belong to 
any community. We formulate the community partition problem for a bipartite network into an integer 
programming model. In addition to the simple form, we show that the proposed criterion overcomes 
the resolution limit for community detection in bipartite networks based on theoretical analysis and 
numerical tests on artificial and real-world networks. Then we design a heuristic algorithm (BiLPA) for 
efficient community detection in large-scale bipartite networks. We demonstrate their effectiveness with 
applications onto artificial and real-world bipartite networks. 
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2 Partition Density of Bipartite Networks 

A bipartite network G = {U, V, E) is a graph with two disjoint types of nodes U and V, U CiV = 0, such 
that there is no edge connecting two nodes of the same type. Let L{U, V) be the number of edges with 
one node in U and the other node in V, then L{U, V) = \E\ is the total number of edges in bipartite 
network G. 

A complete bipartite network or a biclique G = (U, V, E) is a bipartite network, with each node in U 
adjacent to each node in V. In other words, there are L{U, V) = \U\ x \V\ edges in the complete bipartite 
network. Let B{m, n) denote a biclique with m nodes in U and n nodes in V. B{m, n) has the maximal 
number of edges among all bipartite networks with the same node sets U and V . 

Definition 1. Bipartite graph density The density D{G) of a bipartite network G = {U,V,E) 
is defined as the ratio of the number of edges in G to the number of edges of a corresponding biclique 
B{\U\, |y|), which can be calculated by D{G) = Obviously, for any bipartite network without 

parallel edges, the maximal graph density value is 1 when it is a biclique, and the minimal graph density 
value is 0 when it is an empty bipartite network. 

Definition 2. Community partition density Given a bipartite network G = (C/, V, E), we assume 
that G = {Gi, • ■ • , Gk} is a partition of G into K subgraphs, Gc = {Uc, 14, Ec), c = 1, 2, • • • , AT, then 
= U, = V, and Ec = {cij G E\ui € Uc,Vj € 14}. Let L{Uc,Vc) = \Ec\ be the number of 

edges with one node in Uc and the other node in 14 , then the density of subgraph Gc is defined as 

L{Uc,Vc) 

^ \Uc\x\Vc\- 

The partition density D of a bipartite network is defined as the weighted average of Dc, where the weight 
coefficient is the ratio of the number of edges in each community relative to the total number of edges in 
bipartite network G: 


p \" L{Uc, 14) „ 1 \ ^ A(t/e, 14)^ 

^ - L{U,V) ^ \Uc\ X \Vc\- 

We can see that the maximum value of partition density D is 1 when each community is a biclique 
and there is no edge between the communities, and the minimum value of D is 0 when each community 
is an empty bipartite graph. If a bipartite network is partitioned into overlapping communities, we can 
also use formula [1] to calculate the partition density. 

We can easily extend this definition for weighted bipartite networks. Let G = {U,V,E,W) be a 
weighted bipartite network, where W = {'Wij)pxq is the weight matrix, and Wij is the weight of edge Cij 
taking values in [0,1] without loss of generality. Let W{U,V) be the sum of all edges’ weight between 
node sets U and V, then we can define the weighted partition density Dw of weighted bipartite network 
G = (U, V, E, W) as follows: 

^ W{Uc, K) 1 ^ W[Uc,Vcf 

^ ^ W{U,V) ' " W{U,V) ^ \Uc\ X \Vc\ ' 

By maximizing the partition density D and Dw, we can find the optimal community partition of un¬ 
weighted and weighted bipartite networks respectively. 

Definition 3. Core degree Given a partition of bipartite network G = {Gi, • • • , Gk}, for any node 
Ui € U and any community Gc = {Uc,Vc, Ec), the core degree of node Ui to community Gc is defined 
as CD{ui,Gc) = j similarly, for any node Vj G V and the community Gc, the core degree of 

node Vj to community Gc is defined as CD{vj,Gc) = where N{ui) and N{vj) denote the set 

of nodes adjacent to Ui or Vj in bipartite network G. 

If a node has a high core degree to its community, the node might be an important member of the 
community, so we call it a core node. If a node has a low core degree to its community, it might not be 
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an important member of the community, thus we call it a peripheral node. In a biclique, every node has 
core degree 1, and is a core node. A sparse bipartite network has plenty of peripheral nodes. 

Given a partition G = {Gi, • • • , Gk} of bipartite network G, for a node Ui € U which does not belong 
to any community. If we put node Ui to community Gc = {Uc,Vc, E^), then the partition density of the 
network will increase by 


A D{ui,Gc) 


1 (£(I7„K) + |jVK)nK|)^ 

L{U,VY (\u,\ + l)\V,\ \U,m 


1 GL{Uc,vY + \NiuYnv4Y 


L{U,VY \U,\\V,\ |G,||K| 

1 (2£(i/e, Vc) + |jV(^.) n FcDliVK) n K| 

L{U,V) \Ucm 

1 (2L(f/e, V,) + \N{u,) n Vc\) |iVK) n K| 

L{U,V) |C/c| IFcl 

1 i2LiUc,VY + \Niu,)nVc\)^ 


L{U, y) 


\u. 


-CD{u,,G, 


A D{v,,GY = 1 , {L{U,,VY + \N{vY^Uc\Y _ LjUM^ 


L{U,VY \U,\{\V,\ + l) 

I .(A(Gc,I4) + |A^(^^j)nG,|)2 L{Uc,VcY 


lUcWVcl 


L{U,VY \Uc\\v,\ \Uc\m 

I {2L{U„V,) + ifVfa-) n Uc\)\Niv,) n u^l 
L{U,V) \U,\\V,\ 

I (2L(Gc, Vc) + lA^fa-) n UY) lA^fa-) n Uc 


L{U,V) 


IKI 


\Uc 


( 2 ) 


(3) 


I 

WYv) 


{2L{U,,V,) + \N{vj)nU,\) 
\Vc\ 


GD{v,,Gc) 


From the equation ([2]) (or ([3|)) described above, we can see that, if we put a node Ui (or Vj) into a 
community Gc with high core degree, then the network’s partition density will increase much. When 
there are two communities to which node Ui (or Vj) has the same core degree, we should put it into the 
one that contains its most adjacent nodes |A^(iti)nl4| (or \N{vj)riUc\) in order to obtain larger partition 
density. According to this relationship, we will design a heuristic algorithm (BiLPA) for community 
detection in bipartite networks. 


3 Community Partition Density Improves Resolution Limits 

Although Barber’s bipartite modularity has been widely used for community detection in bipartite net¬ 
works, it may fail to detect communities smaller than a scale even in case where communities are bicliques. 
The proposed partition density D can overcome such an issue to some extent. In the following, we will 
give some examples to demonstrate this point. 

A ring of bicliques A ring of bicliques is a bipartite graph composed of s bicliques connected through 
single edges (see Figure [T]A. and[Tj3). Each biclique is a complete bipartite graph B{m,n) with m nodes 
in U, n nodes in V (m, n > 2) and mn edges. The network has a total of (m -I- n)s nodes and (mn -|- l)s 
edges. 
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Figure 1. (A) The connection relationship of two bicliques in (B). (B) The ring of bicliques. Each 
square indicates a biclique of size m + n, and two adjacent bicliques are connected by one edge. (C) A 
bipartite network consists of four bicliques, two of which are B(n,n) and the other two are 


The network has a clear community structure where each biclique corresponds to a single community, 
but the correct result cannot be obtained by optimizing Barber’s bipartite modularity Q. If we partition 
the network into s communities, then each community corresponds to a single biclique. If we partition the 
network into | communities, then each community consists of two adjacent bicliques. When s > 2mn + l, 
we can easily see that Barber’s bipartite modularity for these two partitions: Qs < (see Appendix 
A). Thus, by maximizing Barber’s bipartite modularity, we cannot partition the network into reasonable 
communities. 

Assuming that s > 4, and it can be exactly divided by an integer fe, fc > 2, now we calculate the 
partition density Ds, D^, D‘ by dividing the network into s, | and | communities, respectively. We 
find that and Ds > with mn > 2, s > 4, which confirm that the partition density Ds has 

larger value. 

Although the above analysis is conducted for the special partition that the k consecutive bicliques 
are considered as a single community, by similar argument we can deduce that such a result is actually 
valid for any kind of grouping bicliques (i.e. any combination of bicliques as communities). In fact, the 
partition density of any other combination of bicliques as communities is less than that of the special 
partition Ds mentioned above. Therefore, these results, along with the fact that the maximum value of D 
is obtained in complete bipartite networks lead to a conclusion that the maximum value of D corresponds 
to the correct partition with each single biclique as a community. 

A bipartite network consisting of bicliques of different sizes Assume that there is a bipartite 
network consisting of four bicliques, two of which are B(n,n), and the other two are B{m,m), for 2 < 
m < n (see Figure 1C). This bipartite network has a natural community structure of four communities, 
where each biclique corresponds to one community. If we partition the bipartite network by optimizing 
Barber’s bipartite modularity Q, we will obtain three communities, with two small bicliques merged into 
one community (Figure 1C). 

Let Qseparate s-nd Dseparate be Barber’s bipartite modularity and our partition density when the 
network is partitioned into four communities corresponding to the four bicliques in Figure 1C, and 
Qmerge and Dmerge denote Barber’s bipartite modularity and our partition density when two small 
bicliques merge into one community. We can see that Qseparate < Qmerge with n > rn? + 1, and the 
two small communities will merge into one community (Appendix A). In contrast, it is easy to see that 
Dseparate > Dmerge with m > 2, which indicates the partition density D based optimization does not 
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have the resolution problem mentioned above (Appendix A). 

The above analysis is conducted for the special partition wherein two smaller bicliques are merged 
into a community with each other biclique as a separate community. With the fact that bicliques have 
maximum partition density, it is easy to see that any other partition has a lower partition density than the 
one with each biclique as a community. Therefore, the optimal value of partition density D corresponds to 
the correct partition. In contrast to bipartite modularity Q, optimizing partition density D can correctly 
detect communities of any size. In summary, optimization of partition density D can often achieve correct 
partitions while Barber’s bipartite modularity not. Thus, we deduce that bipartite partition density D 
can act as a proper quantitative function for community detection in bipartite networks. 


4 Integer Programming Model for Bipartite Community Parti¬ 
tion 


Let U = - ,Up}, V = {vi,V 2 ,--‘ ,Vq} be two disjoint node sets of bipartite network G = 

{U, V, E), and A = {aixj)pxq be the adjacency matrix of node sets U and V in G, where Uij = I indicates 
there is an edge e^- between nodes Ui and Vj, aij = 0 otherwise. Note that \E\ = ^be 

total number of edges in G. Let’s define binary variables Xic, yjc to represent whether node Ui and node 
Vj belong to community Gc or not, respectively: 

_ J 1 ifui € Gc 

( 0 otherwise. 


Vjc = 



if Vj € Gc 
otherwise. 


We can formulate the problem of partitioning bipartite network G into K communities into an integer 
programming model (Model-I): 


1 


K 




(ELi T,Ui (^^jXicVjcf 


(4) 


' K 

'^Xic = l i = 1,2,--- ,p (5) 

C=1 

K 

' '^yjc = ^ j = l,2, •••,g (6) 

{0,1}; i = 1,2, ••• ,p,c= 1,2, ••• ,A: (7) 

.yicejo,!}; j = 1,2,---,g,c= 1,2,---,A: (8) 

where the objective function (4) is to maximize partition density D, constraints (5) and (6) indicate 
that each node must belong to one and only one community, and constraints (7) and (8) ensure that 
the variables are binary ones. We can easily solve the integer programming model by using the Lingo 
software m for small-scale bipartite networks to see if the model can find communities properly. To 
partition a bipartite network into overlapping communities, we can simply relax the constraints (5) and 
(6) to allow a node to belong to more than one community by changing the constraints to > 1 

for i = 1, 2, • • • ,p and Ve > 1 for j = 1, 2, • • • , g. We represent such a model as Model-2. 

For example, the bipartite network in FigurejUA. has graph density 0.4321. We can easily partition it 
into two and three communities by solving Model-1 and obtain the partition density 0.6391 and 0.9429. If 
we partition it into more than 3 communities, the partition density will decrease and thus the maximum 
partition density is 0.9429 which seems reasonable. For the bipartite network in Figure 03, we can 
partition it into two overlapping communities by solving Model-2 with the maximum partition density 1. 
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Figure 2. (A) Three communities of a bipartite network. The nodes in the same community are drawn 
in the same color, and the edges between communities are dotted. (B) Partitioning a bipartite network 
into two overlapping communities. The red node is overlapped by the two communities. (C) A chain of 
heterogeneous bicliques consisting of bicliques B(3,4), B(4,5), B(5,5), B(4,3) and B(6,5). Two adjacent 
bicliques are overlapping with one red node. 


5 BiLPA for Bipartite Community Partition 

To efficiently solve the community partition problem, we further develop a heuristic algorithm (BiLPA) for 
community detection in large-scale bipartite networks, which does not need to pre-specify the number of 
communities K. Raghavan et al. |32j firstly proposed a label propagation algorithm (LPA) for community 
detection in large-scale unipartite networks. Barber and Clark [53] generalized it (LPAb) to optimize 
bipartite modularity. Liu and Murata further introduced an improved label propagation algorithm 

(LPAb-|-) for this task. However, both LPAb and LPAb-|- are designed for maximizing Barber’s bipartite 
modularity. In this paper, we introduce a revised Label Propagation Algorithm (BiLPA) for community 
detection in large-scale bipartite networks by maximizing our partition density D. 

Let A = (aij)pxq be the adjacency matrix of bipartite network G, and and dy. be the degrees of 
node Ui and node vj, respectively. The key idea of BiLPA is as follows. At first, each node Ui G U in 
the bipartite network is initially assigned a unique label, indicating its community membership (without 
causing confusion, we use to indicate node Mi’s label.). Then each node Vj G V acquires a label from 
its neighboring nodes in U, indicating its community membership (we use bj to indicate the node Uj’s 
label). In the following steps, the labels of nodes in U and nodes in V are updated in turn, until a local 
maximum partition density is obtained. The updating rule is based on the following idea: In order to 
find the maximum partition density, every node always chooses to join in a community that leads to the 
maximum increase of the partition density. From equations ([3]) and ([Sj) , we can see that the increase of the 
partition density after node Ui (or Vj) joins in community Gc = {Uc, 14, Ey) is related to the core degree 
CD{ui, Gc) (or CD{vj, Gc)) and the number of node Ui’s (or Vj’s) neighbors belonging to community Gc- 
So we can give two updating rules as follows: (I) If there is only one community to which the node has the 
maximum core degree, then the node joins in this community; (II) If there are two or more communities 
to which the node has maximum core degree, then the node chooses to join the one that contains most 
nodes adjacent to it. These rules can be described by the following formulae respectively. For node Vj, 
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( 9 ) 


For node Ui, 


LB{j) = argmax' 
a 


T,LiaijSig,ri) 

EtiS{9,n) 


P 

yaew ^ arg max^ Oy (5 ( 5 , Ti ). 
g&LB(j) 


( 10 ) 


LR{i) = argmax 
9 


E1=1 5{9. bj) 


q 

= argmax^ay(5(5,&j), 
geLR{i) 


( 11 ) 

( 12 ) 


where g G {1,2, ...,K} is a community label, and 5(x,y) = 1 ii x = y, otherwise S{x,y) = 0. LB[j) and 
LRii) are the label sets of communities corresponding to the maximum core degree. Formula (9) (or (11) 
finds the label set of communities to which vj (or Ui) has maximum core degree. Formula (10) (or (12)) 
finds the label of the community among LB{j) (or LR(i)) where vj (or Ui) has most adjacent nodes in 
it. Formulae (9) (or (11)) and (10) (or (12)) mean that a node should be assigned to a community to 
which the node has maximum core degree and most adjacent nodes. 

The BiLPA pseudo-code is presented in Algorithm BiLPA. 


Algorithm BiLPA: 

• Input 

Input the adjacency matrix A of bipartite network G = (U^V,E), and calculate the number of 
nodes p {p = |C/|) in U and the number of nodes q {q = |F|) in V, respectively. Set the maximum 
number of iterations Iter^nax and the parameter value 9. 

• Output 

Output a community partition and its partition density, and the rearranged matrix according to 
the partition. 

• Step 1. Initialize the labels of nodes in U. For each node Ui, let t = 0 and 0(0) = 0; 

• Step 2. Calculate the number of nodes in U with the same labels. For each label k, calculate the 

number of nodes in U labeled with k using R{k) = o)- 

• Step 3. Update the labels of nodes in V. For each node Vj, update its label with 


LB{j) = argmax 

k 


R{k) 


p 

bnew _ argmax^ayA(fc,Ti). 


• Step 4. Calculate the number of nodes in V with the same labels. For each label fc, calculate the 

number of nodes in V labeled with k using B{k) = bj). 

• Step 5. Update the labels of nodes in U. For each node Ui, update its label with 


Y.%iai]5{k,bj) 


LiTl(i)= argmax 

k B{k) 






<? 

^new _ argmax^ aij(5(fc, 6j). 
kGLR(i) ^ 

• Step 6. Calculate the partition density D(t). If D{t) < D{t — 1) or t < Itermax, go to Step 7, else 
t = t + 1, go to Step 2. 

• Step 7. Calculate the final label of each node and the corresponding partition density. For each 
node Ui in U (or similarly for node Vj in V), calculate its core degree CD(ui, k) to each community k 
and find its maximum core degree maxg CD(ui, g). Calculate the core degree rate RCD(ui, k) with 
RCD(ui,k) = max g ) ' each community label k, determine the community membership 
of Ui by examining whether RCD(ui, k) is larger than 0 or not, and assign the label to node ut in 
U if so. Then calculate the final partition density. 

• Step 8. Rearrange the adjacency matrix A according to the partition result. First, construct a 
matrix B by adjusting the order of the rows of matrix A according to the label of nodes in U. For 
k = 1, • • • ,p, let the R{k) rows of A corresponding to nodes in U labeled with k consist of the 

i?(l) H-h R{k — 1) + 1, R{1) H-h R{k — 1) + 2, • • • , i?(l) H-h R{k — 1) + R{k) rows of matrix 

B, respectively. Then, construct the rearranged matrix C from B according to the label of nodes 
in V. For k = 1, • • • ,p, let the R{k) columns of B corresponding to nodes in V labeled with k 
(matrices B and A have the same column labels) consist of the i?(l) + • • • + R{k — 1) + 1, i?(l) + 
• • • + R{k — 1) + 2, • • • , R{1) + • • • + R{k — 1) + R{k) columns of matrix C, respectively. 

We allow a node to belong to multiple communities in Step 7 which enables us to find overlapping 
nodes in a bipartite network. 

In BiLPA, initialization of each node in U with unique labels requires 0{p) time. At each iteration, 
for each node Ui (or vj), we first group its neighbors according to their labels (0(g) or 0{p)), then 
calculate the core degree to each community {0{q) or 0{p)), and select the community corresponding 
to its maximum core degree and containing its largest number of neighbors {0{q) or 0(p)). Finally, we 
calculate the partition density {0{pq)), where p and q are the numbers of two disjoint sets of nodes in 
the bipartite network. So the total time complexity of each iteration is 0(pg). 

6 Experiment Results 

6.1 A chain of heterogeneous bicliques 

We test our BiLPA algorithm on a type of exemplar bipartite networks, i.e. a chain network of K hetero¬ 
geneous bicliques connected through single nodes (Figure [2P). Assuming that biclique Bi = {Ui,Vi,Ei) 
has Si + ti nodes and \Ei\ = Si x U edges, then the network has a total of iV = + U) — K + 1 

nodes and M = \^i\ edges. The network has a clear bipartite community structure where each 

community corresponds to a single biclique, thus the optimal partition density is 1. Using the BiLPA 
algorithm described above, we can easily detect the optimal partition and identify the overlapping nodes. 

6.2 Real-world Networks 

In this subsection, we validate our algorithm on several real-world networks, including the Southern 
women network, the Scotland corporate interlock network, the Malaria gene-substring network, and the 
protein complex-drug network. We applied BiLPA to these networks with /termax = 100 and used 9 
to control the membership of each node. We can obtain a hard partition with 9 = 1.0 and overlapping 
communities with 0 < 1.0. 

The Southern women network The famous Southern women bipartite network collected in 1930s 
described women’s attendance to social events in the town of Natchez, Mississippi [HUS]- It consists of 
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18 women and 14 events with 89 edges between them. It has become a popular benchmark for discussing 
and exploring bipartite networks in the social science m- Guimera [T^] has analyzed the modules of both 
women and events by three methods, including an unweighted projection method, a weighted projection 
method and a bipartite modularity maximization method. The unweighted projection method did not 
capture the true modular structure of the network. The weighted projection method and the bipartite 
modularity approach capture the two-community structure of the network except that one woman was 
partitioned wrongly. We applied BiLPA with 0 = 1.0 to the Southern women network and identified two 
communities of the network, consisting of {A1,A2,A3,A4,A5,A6,A7,A8,A9; B1,B2,B3,B4,B5,B6.B7,B8 } 
and {A10,A11,A12,A13,A14,A15,A16,A17,A18; B9,B10,B11,B12,B13,B14 }. The corresponding partition 
density is 0.491. We can also obtain four overlapping communities with 0 = 0.8, consisting of {A3, A4, 
A5, A6, A9; B3, B4, B5, B7}, {Al, A2, A6, A7, A8, A9; Bl, B2, B3, B5, B6, B8}, {A10,A11,A12, 
A13, A16; B8, B9, BIO, B12, B13, B14}, and (All, A14, A15, A17, A18; B9, BIO, Bll} respectively 
(Figure 3). The corresponding partition density is 0.601. If we use a smaller 9 (e.g., 9 = 0.7), we can also 
partition the network into four communities. In such cases, more nodes will be determined as overlapping 
ones, and the partition density will be much larger. 



Figure 3. The four overlapping communities of the Southern women network detected by BiLPA. The 
red nodes A6, A9, All, B3, B5, B8, B9, BIO are overlapping ones. 


The Scotland corporate interlock network The Scotland corporate interlock network consisting 
of 244 nodes and 356 edges describes the corporate interlocks in Scotland in the beginning of the twentieth 
century (1904-1905). The 244 nodes are divided into two parts, where 136 nodes represent the board 
members who held multiple directorships, and 108 nodes denote the firms. The edges exist between each 
firm and its board members. The Scotland corporate interlock network is not connected. The largest 
component of the network contains 131 directors and 86 firms, which form 36 communities [39] . 

We applied our BiLPA algorithm to the largest component of the network. We determined 42 com¬ 
munities (Figure 4A) and obtained a partition density 0.524 with 9 = 1.0. We can also determine 23 
nodes belong to multiple communities with 9 = 0.8. However, by optimizing Barber’s modularity, the 
network can only be partitioned into 20 communities, and some small communities are merged into large 
communities. Obviously, optimizing our partition density with BiLPA algorithm provides higher quality 
community structure than other methods. 

The Malaria gene-substring network The Malaria gene-substring network comes from the malaria 
parasite P. falciparum. The parasite can evade the human immune system via a protein camouflage 
encoded in var genes |491I50| . In order to create novel camouflages, var genes frequently recombine, 
which amounts to the constrained splicing and shuffling of genetic substrings, giving rise to community 
structures naturally [50] . The network consists of two types of nodes corresponding to 297 genes and 806 


10 

































constituent substrings respectively. Each substring connects to a gene in which it is present. There are 
2965 edges in the network and the link density of the bipartite network is 0.012. 

In |50j . the network is partitioned into three coarse communities. We applied our BiLPA algorithm 
to the network and identified 158 hard communities with a maximal partition density 0.41 using 6 = 1.0 
(Figure 4B). From Figure 4B, we can see that this network is very sparse and thus partitioning it into 
three communities is unreasonable. Our method obtains very detailed communities. For example, BiLPA 
detects a biclique community consisting of four genes and 14 substring (the circled community in Figure 
4B), while it was merged into a very large one in [50]. This result indicates that BiLPA can find better 
community structure and conquer the resolution limit showed in m- 

The protein complex-drug network The bipartite network of protein complexes and drugs in¬ 
vestigated by Nacher and Schwartz m contains 1419 nodes (680 drugs and 739 complexes) and 3690 
edges. Nacher and Schwartz identified network communities in both the bipartite network itself and the 
projected networks m- The number of modules in the drug projection network is 23, while the number 
of modules in the protein complex projection network is 17. Using the bipartite network itself, they 
detected 48 modules of drugs and 42 modules of complexes. 

We applied our BiLPA to this network and partitioned it into 122 communities with partition density 
0.57 (Figure 5A). We can see that each small disconnected subgraph is identified as one community. The 
giant connected subgraph is partitioned into about 90 communities. This network is very sparse and most 
communities are very sparsely connected with others, but densely connected within themselves. For ex¬ 
ample, we demonstrated two distinct complex-drug communities in Figure 5B and Figure 5C. The circled 
community in Figure 5B consists of four complexes (CORUM_0441, CORUM_1054, CORUM_2124 and 
CORUM_5862) and 40 drugs. Most drugs are fully connected with these four complexes. Another circled 
dense community in Figure 5C consists of five complexes (CORUM_1223, CORUM_2242, CORUM_4158, 
CORUM_5189, CORUM_5526) and 16 drugs. The bipartite communities reveal that some drugs target to 
a same set of biological complexes. This is valuable for prediction of the detailed mechanisms of unknown 
ones from the well-known drugs’ functions. 

7 Conclusion and discussion 

Community structure is one of the main characteristics of bipartite networks and is valuable for under¬ 
standing the organization and functions of such networks. In this paper, we propose a new quantitative 
function — bipartite partition density — for evaluating the community partition of bipartite networks. 
We show that our quantitative function can conquer the serious resolution limit issues of modularity-based 
functions. We first formulate the community detection problem for bipartite networks into an integer 
nonlinear programming model. Furthermore, we design a fast heuristic algorithm BiLPA for solving 
the community detection problem and conduct validation experiments on some artificial and real-world 
networks. The computational results demonstrate that our model and algorithm can detect overlapping 
communities in bipartite networks. Moreover, the quantitative function proposed in this paper can be 
easily extended to the community detection in weighted bipartite networks. 
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Appendix A: Community Partition Density Improves Resolution Limits 

A ring of bicliques The ring of bicliques has a distinct community structure where each biclique 
corresponds to a single community, but the correct result cannot be obtained by optimizing Barber’s 
bipartite modularity Q. If we partition the network into s communities, and each community corresponds 
to a single biclique, then Barber’s bipartite modularity Q is 

mn mn -I-1 mn + 1 

s(mn + 1) s(mn 1) s(mn -|- 1) 

mn 1 
mn -b 1 s 

If we partition the network into | communities, and each community consists of two adjacent bicliques, 
then Barber’s bipartite modularity Q is 

2mn + 1 
s(rnn 1) 

2mn -b 1 2 

2{mn -b 1) s 

When s > 2{mn -b 1), we have 

mn 1 2mn -b 1 2 

* ^ mn -b 1 s 2{mn -b 1) s 

_ 1 1 
s 2{mn + 1) 

< 0 . 

The partition density Dg of the natural partition can be easily and analytically calculated as follows: 

(mnY' \ mn 

-• s =--. 

mn ) mn +1 


Dg = 


1 


s(mn -b 1) 



2mn + 2 2mn + 2 \ s 
s{mn -b 1) s{mn +1) J 2 
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On the other hand, the partition density of the partition in which the k consecutive bicliques are 
considered as a single community is 


Then, 


D± = 
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i(mn + 1) 
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Supposing mn > 2, s > 4, when fc = 2, 


Ds-D± = 
> 0 . 


(mn)(fc-(l + ^)^) 
k{mn + 1 ) 

{mn){2 — (1 


2mn 




2{mn + 1 ) 


When fc > 3, 


Ds-Ds = 
® 3 


(mn)(fc-(l + |^) 2 ) 


k{r 


1 ) 


> 


(mn)(fc-(1 + ^)2) 


k{r 


1 ) 


> 


{mn){k - (1 + 5 )^) 
k{mn + 1 ) 


> 0 . 

A bipartite network consisting of bicliques of different sizes Suppose there is a bipartite 
network consisting of four bicliques, two of which are B{n,n), and the other two are B{m,m), for 
2 < m < n. This bipartite network has a natural community structure of four communities, where each 
biclique corresponds to one community. If we partition the bipartite network by optimizing Barber’s 
bipartite modularity Q, we will obtain three communities, where two small bicliques merge into one 
community. We can calculate their corresponding Q as follows: 
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If n > m? + 1, then Qseparate ~ Qmerge < 0. This means that modularity Q-based optimization will 
merge two smaller bicliques B(rn,n)s into one community. 

- : that partition density D-hased optimization does not have this resolution 


In the following, we prove 
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Substrings 


Figure 4. The communities of the Scotland corporate interlock network (A) and the Malaria 
gene-substrings network (B) detected by BiLPA. 
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Figure 5. The communities of the protein complex-drug network detected by BiLPA (A) and two 
illustrative community examples in (B) and (C). These heatmaps were depicted based on the 
rearranged matrices according to the output from BiLPA 
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